Analyzing Ola Data for Predicting Price Based Trip Distance Using Random Forest and Linear Regression Analysis

نویسندگان

چکیده

The paper aims to create a most efficient and accurate cab fare prediction system using machine learning algorithms comparing them. are Random forest algorithm Linear regression the r-square, mean square error (MSE), Root MSE Mean Squared Logarithmic Error (RMSLE) values. We implement linear predict prices of get best accuracy when both algorithms. should be trips before starting trip. sample size considered for this work is N=10 each groups considered. Totally it was iterated 20 times analysis on price with G-power in 80% threshold 0.05%, CI 95% standard deviation. calculation done clincle. Based statistical significance value calculating r-square found 0.034. gives slightly better rate percentage 71.67% has 70.57%. By process, online rental compared algorithm.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

HIERARCHICAL DATA CLUSTERING MODEL FOR ANALYZING PASSENGERS’ TRIP IN HIGHWAYS

One of the most important issues in urban planning is developing sustainable public transportation. The basic condition for this purpose is analyzing current condition especially based on data. Data mining is a set of new techniques that are beyond statistical data analyzing. Clustering techniques is a subset of it that one of it’s techniques used for analyzing passengers’ trip. The result of...

متن کامل

Longitudinal Discriminant Analysis with Random Effects for Predicting Preeclampsia using Hematocrit Data

Background and Objectives: Preeclampsia is the third leading cause of death in pregnant women. This study was conducted to evaluate the ability of longitudinal hematocrit data to predict preeclampsia and to compare the accuracy in longitudinal and cross-sectional data. Materials and Methods: In a prospective cohort study from October 2010 to July 2011, 650 pregnant women referred to the prenata...

متن کامل

Classification and Biomarker Genes Selection for Cancer Gene Expression Data Using Random Forest

Background & objective: Microarray and next generation sequencing (NGS) data are the important sources to find helpful molecular patterns. Also, the great number of gene expression data increases the challenge of how to identify the biomarkers associated with cancer. The random forest (RF) is used to effectively analyze the problems of large-p and smal...

متن کامل

Variable Importance Assessment in Regression: Linear Regression versus Random Forest

Relative importance of regressor variables is an old topic that still awaits a satisfactory solution. When interest is in attributing importance in linear regression, averaging over orderings methods for decomposing R2 are among the state-of-theart methods, although the mechanism behind their behavior is not (yet) completely understood. Random forests—a machinelearning tool for classification a...

متن کامل

determinate aster satellite data capability and classification and regression tree and random forest algorithm for forest type mapping

recognition equal units and segregation them and upshot planning per units most basic method for management forest units. aim this study presentation and comparison classification and regression tree (cart) and random forest (rf) algorithm for forest type mapping using aster satellite data in district one didactic and research forest's darabkola. in start using inventory network 500* 350 m...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Advances in parallel computing

سال: 2022

ISSN: ['1879-808X', '0927-5452']

DOI: https://doi.org/10.3233/apc220086